Picture for Chien-Sheng Wu

Chien-Sheng Wu

Agentic Uncertainty Quantification

Add code
Jan 22, 2026
Viaarxiv icon

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Agentic Confidence Calibration

Add code
Jan 22, 2026
Viaarxiv icon

The Need for a Socially-Grounded Persona Framework for User Simulation

Add code
Jan 12, 2026
Viaarxiv icon

MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion

Add code
Oct 26, 2025
Viaarxiv icon

GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness

Add code
Oct 01, 2025
Viaarxiv icon

CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

Add code
May 24, 2025
Viaarxiv icon

AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation

Add code
Apr 10, 2025
Viaarxiv icon

BingoGuard: LLM Content Moderation Tools with Risk Levels

Add code
Mar 09, 2025
Viaarxiv icon

Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents

Add code
Feb 24, 2025
Viaarxiv icon